Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 117350 |
| Missing cells | 182737 |
| Missing cells (%) | 6.2% |
| Duplicate rows | 9198 |
| Duplicate rows (%) | 7.8% |
| Total size in memory | 180.7 MiB |
| Average record size in memory | 1.6 KiB |
Variable types
| CAT | 16 |
|---|---|
| NUM | 9 |
Reproduction
| Analysis started | 2020-02-27 02:53:28.546766 |
|---|---|
| Analysis finished | 2020-02-27 02:58:43.303706 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 9198 (7.8%) duplicate rows | Duplicates |
description has a high cardinality: 108122 distinct values | High cardinality |
designation has a high cardinality: 34149 distinct values | High cardinality |
province has a high cardinality: 406 distinct values | High cardinality |
region_1 has a high cardinality: 1190 distinct values | High cardinality |
title has a high cardinality: 73194 distinct values | High cardinality |
variety has a high cardinality: 674 distinct values | High cardinality |
winery has a high cardinality: 15595 distinct values | High cardinality |
description_desc has a high cardinality: 108109 distinct values | High cardinality |
aspects has a high cardinality: 106339 distinct values | High cardinality |
positive_score is highly correlated with neutral_score | High Correlation |
neutral_score is highly correlated with positive_score | High Correlation |
taster_twitter_handle is highly correlated with taster_name | High Correlation |
taster_name is highly correlated with taster_twitter_handle | High Correlation |
designation has 34602 (29.5%) missing values | Missing |
price has 8131 (6.9%) missing values | Missing |
region_1 has 19027 (16.2%) missing values | Missing |
region_2 has 70191 (59.8%) missing values | Missing |
taster_name has 23022 (19.6%) missing values | Missing |
taster_twitter_handle has 27658 (23.6%) missing values | Missing |
polarity_score has 12056 (10.3%) zeros | Zeros |
negative_score has 88537 (75.4%) zeros | Zeros |
positive_score has 17207 (14.7%) zeros | Zeros |
country
Categorical
| Distinct count | 43 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 53 |
| Missing (%) | < 0.1% |
| Memory size | 1.8 MiB |
| US | |
|---|---|
| France | |
| Italy | |
| Spain | 5737 |
| Portugal | 4925 |
| Other values (38) |
| Value | Count | Frequency (%) | |
| US | 50799 | 43.3% | |
| France | 18949 | 16.1% | |
| Italy | 17412 | 14.8% | |
| Spain | 5737 | 4.9% | |
| Portugal | 4925 | 4.2% | |
| Chile | 4033 | 3.4% | |
| Argentina | 3600 | 3.1% | |
| Austria | 3022 | 2.6% | |
| Germany | 2095 | 1.8% | |
| Australia | 1946 | 1.7% | |
| Other values (33) | 4779 | 4.1% |
Length
| Max length | 22 |
|---|---|
| Mean length | 4.431580741 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 24 | 57.1% | |
| Uppercase_Letter | 17 | 40.5% | |
| Space_Separator | 1 | 2.4% |
| Value | Count | Frequency (%) | |
| Latin | 41 | 97.6% | |
| Common | 1 | 2.4% |
| Value | Count | Frequency (%) | |
| ASCII | 42 | 100.0% |
| Distinct count | 108122 |
|---|---|
| Unique (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| This zesty red has pretty aromas that suggest small red berry, blue flower and a whiff of moist soil. The vibrant palate offers sour cherry, pomegranate and a hint of anise alongside zesty acidity and refined tannins. | 3 |
|---|---|
| Gravenstein apple, honeysuckle and jasmine aromas show on the relatively boisterous nose of this bottling from a large vineyard on Highway 46 east of Paso Robles. There is compellingly grippy texture to the sip, with ripe flavors of pear and honeydew melon. A salty acidity takes it to the next level. | 3 |
| Cigar box, café au lait, and dried tobacco aromas are followed by coffee and cherry flavors, with barrel spices lingering on the finish. The wood gets a bit out front but it still delivers enjoyment. | 3 |
| Ripe plum, game, truffle, leather and menthol are some of the aromas you'll find on this earthy wine. The tightly wound palate offers dried black cherry, chopped sage, mint and roasted coffee bean alongside raspy tannins that leave a mouth-drying finish. | 3 |
| Seductively tart in lemon pith, cranberry and pomegranate, this refreshing, light-bodied quaff is infinitely enjoyable, both on its own or at the table. It continues to expand on the palate into an increasing array of fresh flavors, finishing in cherry and orange. | 3 |
| Other values (108117) |
| Value | Count | Frequency (%) | |
| This zesty red has pretty aromas that suggest small red berry, blue flower and a whiff of moist soil. The vibrant palate offers sour cherry, pomegranate and a hint of anise alongside zesty acidity and refined tannins. | 3 | < 0.1% | |
| Gravenstein apple, honeysuckle and jasmine aromas show on the relatively boisterous nose of this bottling from a large vineyard on Highway 46 east of Paso Robles. There is compellingly grippy texture to the sip, with ripe flavors of pear and honeydew melon. A salty acidity takes it to the next level. | 3 | < 0.1% | |
| Cigar box, café au lait, and dried tobacco aromas are followed by coffee and cherry flavors, with barrel spices lingering on the finish. The wood gets a bit out front but it still delivers enjoyment. | 3 | < 0.1% | |
| Ripe plum, game, truffle, leather and menthol are some of the aromas you'll find on this earthy wine. The tightly wound palate offers dried black cherry, chopped sage, mint and roasted coffee bean alongside raspy tannins that leave a mouth-drying finish. | 3 | < 0.1% | |
| Seductively tart in lemon pith, cranberry and pomegranate, this refreshing, light-bodied quaff is infinitely enjoyable, both on its own or at the table. It continues to expand on the palate into an increasing array of fresh flavors, finishing in cherry and orange. | 3 | < 0.1% | |
| Testarossa's annual best-barrel offering starts with broad aromas of violets, blackberry tea and anise on the edges. It's a lighter Pinot density-wise, very soft and expressive with purple fruits, black olive and a fennel-powered acidify on the finish. | 2 | < 0.1% | |
| Subtle aromas of black-skinned fruit and baking spice waft out of the glass. The straightforward palate evokes black plum, vanilla and tobacco while assertive tannins provide support. Drink through 2020. | 2 | < 0.1% | |
| This fresh, crisp wine hovers attractively between crisp, herbal Sauvignon and riper yellow fruits. Touches of wood adds richness to this already-drinkable wine. Drink now or better, from 2017. | 2 | < 0.1% | |
| The variety is unmistakable, with notes of dried herb, dark cherry and espresso. The jammy fruit flavors are rich, with lightly grainy tannins and coffee notes on the finish. It shows a fine sense of balance. | 2 | < 0.1% | |
| This is a soft wine, fruity without great definition. A ripe, red fruit character contrasts with a bitter acidity. It is ready to drink. The blend is 50/50 Merlot and Cabernet Sauvignon. | 2 | < 0.1% | |
| Other values (108112) | 117325 | > 99.9% |
Length
| Max length | 829 |
|---|---|
| Mean length | 243.3071751 |
| Min length | 20 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 54 | 38.8% | |
| Uppercase_Letter | 33 | 23.7% | |
| Other_Punctuation | 15 | 10.8% | |
| Decimal_Number | 10 | 7.2% | |
| Math_Symbol | 4 | 2.9% | |
| Modifier_Symbol | 3 | 2.2% | |
| Control | 3 | 2.2% | |
| Dash_Punctuation | 3 | 2.2% | |
| Space_Separator | 3 | 2.2% | |
| Initial_Punctuation | 2 | 1.4% | |
| Other values (9) | 9 | 6.5% |
| Value | Count | Frequency (%) | |
| Latin | 88 | 63.3% | |
| Common | 51 | 36.7% |
| Value | Count | Frequency (%) | |
| ASCII | 87 | 92.6% | |
| Punctuation | 7 | 7.4% |
| Distinct count | 34149 |
|---|---|
| Unique (%) | 41.3% |
| Missing | 34602 |
| Missing (%) | 29.5% |
| Memory size | 1.8 MiB |
| Reserve | 1835 |
|---|---|
| Estate | 1269 |
| Reserva | 1181 |
| Riserva | 658 |
| Estate Grown | 603 |
| Other values (34144) |
| Value | Count | Frequency (%) | |
| Reserve | 1835 | 1.6% | |
| Estate | 1269 | 1.1% | |
| Reserva | 1181 | 1.0% | |
| Riserva | 658 | 0.6% | |
| Estate Grown | 603 | 0.5% | |
| Barrel sample | 375 | 0.3% | |
| Dry | 344 | 0.3% | |
| Crianza | 327 | 0.3% | |
| Estate Bottled | 298 | 0.3% | |
| Vieilles Vignes | 295 | 0.3% | |
| Other values (34139) | 75563 | 64.4% | |
| (Missing) | 34602 | 29.5% |
Length
| Max length | 95 |
|---|---|
| Mean length | 11.58517256 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 60 | 41.4% | |
| Uppercase_Letter | 41 | 28.3% | |
| Other_Punctuation | 14 | 9.7% | |
| Decimal_Number | 10 | 6.9% | |
| Control | 2 | 1.4% | |
| Other_Symbol | 2 | 1.4% | |
| Close_Punctuation | 2 | 1.4% | |
| Math_Symbol | 2 | 1.4% | |
| Space_Separator | 2 | 1.4% | |
| Open_Punctuation | 2 | 1.4% | |
| Other values (6) | 8 | 5.5% |
| Value | Count | Frequency (%) | |
| Latin | 101 | 69.7% | |
| Common | 44 | 30.3% |
| Value | Count | Frequency (%) | |
| ASCII | 86 | 94.5% | |
| Punctuation | 4 | 4.4% | |
| Geometric Shapes | 1 | 1.1% |
points
Real number (ℝ≥0)
| Distinct count | 21 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88.53304644226672 |
|---|---|
| Minimum | 80 |
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 80 |
|---|---|
| 5-th percentile | 84 |
| Q1 | 86 |
| median | 88 |
| Q3 | 91 |
| 95-th percentile | 93 |
| Maximum | 100 |
| Range | 20 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.040643014 |
|---|---|
| Coefficient of variation (CV) | 0.03434472365 |
| Kurtosis | -0.3136172064 |
| Mean | 88.53304644 |
| Median Absolute Deviation (MAD) | 2.492405116 |
| Skewness | 0.03014917539 |
| Sum | 10389353 |
| Variance | 9.24550994 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 80. 81.5 82.5 83.5 84.5 ... 95.5 96.5 97.5 98.5 100. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 88 | 15441 | 13.2% | |
| 87 | 15019 | 12.8% | |
| 90 | 14060 | 12.0% | |
| 86 | 11119 | 9.5% | |
| 89 | 11058 | 9.4% | |
| 91 | 10568 | 9.0% | |
| 92 | 9017 | 7.7% | |
| 85 | 8372 | 7.1% | |
| 93 | 6149 | 5.2% | |
| 84 | 5636 | 4.8% | |
| Other values (11) | 10911 | 9.3% |
| Value | Count | Frequency (%) | |
| 80 | 337 | 0.3% | |
| 81 | 584 | 0.5% | |
| 82 | 1568 | 1.3% | |
| 83 | 2562 | 2.2% | |
| 84 | 5636 | 4.8% |
| Value | Count | Frequency (%) | |
| 100 | 16 | < 0.1% | |
| 99 | 30 | < 0.1% | |
| 98 | 70 | 0.1% | |
| 97 | 217 | 0.2% | |
| 96 | 492 | 0.4% |
| Distinct count | 375 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 8131 |
| Missing (%) | 6.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.93218212948297 |
|---|---|
| Minimum | 4.0 |
| Maximum | 3300.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 18 |
| median | 26 |
| Q3 | 44 |
| 95-th percentile | 85 |
| Maximum | 3300 |
| Range | 3296 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 41.17640796 |
|---|---|
| Coefficient of variation (CV) | 1.145947881 |
| Kurtosis | 891.4624011 |
| Mean | 35.93218213 |
| Median Absolute Deviation (MAD) | 20.06371919 |
| Skewness | 18.83952217 |
| Sum | 3924477 |
| Variance | 1695.496573 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 20 | 6157 | 5.2% | |
| 25 | 5279 | 4.5% | |
| 15 | 5243 | 4.5% | |
| 30 | 4574 | 3.9% | |
| 18 | 4261 | 3.6% | |
| 40 | 3664 | 3.1% | |
| 35 | 3558 | 3.0% | |
| 12 | 3391 | 2.9% | |
| 50 | 3100 | 2.6% | |
| 16 | 3081 | 2.6% | |
| Other values (365) | 66911 | 57.0% | |
| (Missing) | 8131 | 6.9% |
| Value | Count | Frequency (%) | |
| 4 | 7 | < 0.1% | |
| 5 | 26 | < 0.1% | |
| 6 | 81 | 0.1% | |
| 7 | 298 | 0.3% | |
| 8 | 706 | 0.6% |
| Value | Count | Frequency (%) | |
| 3300 | 1 | < 0.1% | |
| 2500 | 2 | < 0.1% | |
| 2013 | 1 | < 0.1% | |
| 2000 | 2 | < 0.1% | |
| 1900 | 1 | < 0.1% |
| Distinct count | 406 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 53 |
| Missing (%) | < 0.1% |
| Memory size | 1.8 MiB |
| California | |
|---|---|
| Washington | 8266 |
| Bordeaux | 5635 |
| Tuscany | 5553 |
| Oregon | 4990 |
| Other values (401) |
| Value | Count | Frequency (%) | |
| California | 33767 | 28.8% | |
| Washington | 8266 | 7.0% | |
| Bordeaux | 5635 | 4.8% | |
| Tuscany | 5553 | 4.7% | |
| Oregon | 4990 | 4.3% | |
| Burgundy | 3824 | 3.3% | |
| Northern Spain | 3605 | 3.1% | |
| Piedmont | 3598 | 3.1% | |
| Mendoza Province | 3099 | 2.6% | |
| New York | 2386 | 2.0% | |
| Other values (396) | 42574 | 36.3% |
Length
| Max length | 31 |
|---|---|
| Mean length | 10.00216447 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 41 | 56.2% | |
| Uppercase_Letter | 28 | 38.4% | |
| Other_Punctuation | 2 | 2.7% | |
| Dash_Punctuation | 1 | 1.4% | |
| Space_Separator | 1 | 1.4% |
| Value | Count | Frequency (%) | |
| Latin | 69 | 94.5% | |
| Common | 4 | 5.5% |
| Value | Count | Frequency (%) | |
| ASCII | 55 | 100.0% |
| Distinct count | 1190 |
|---|---|
| Unique (%) | 1.2% |
| Missing | 19027 |
| Missing (%) | 16.2% |
| Memory size | 1.8 MiB |
| Napa Valley | 4139 |
|---|---|
| Columbia Valley (WA) | 3929 |
| Russian River Valley | 2967 |
| Paso Robles | 2242 |
| Mendoza | 2190 |
| Other values (1185) |
| Value | Count | Frequency (%) | |
| Napa Valley | 4139 | 3.5% | |
| Columbia Valley (WA) | 3929 | 3.3% | |
| Russian River Valley | 2967 | 2.5% | |
| Paso Robles | 2242 | 1.9% | |
| Mendoza | 2190 | 1.9% | |
| California | 2149 | 1.8% | |
| Willamette Valley | 2119 | 1.8% | |
| Alsace | 2028 | 1.7% | |
| Barolo | 1566 | 1.3% | |
| Sonoma Coast | 1453 | 1.2% | |
| Other values (1180) | 73541 | 62.7% | |
| (Missing) | 19027 | 16.2% |
Length
| Max length | 50 |
|---|---|
| Mean length | 11.83266297 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 40 | 53.3% | |
| Uppercase_Letter | 26 | 34.7% | |
| Other_Punctuation | 4 | 5.3% | |
| Final_Punctuation | 1 | 1.3% | |
| Space_Separator | 1 | 1.3% | |
| Open_Punctuation | 1 | 1.3% | |
| Close_Punctuation | 1 | 1.3% | |
| Dash_Punctuation | 1 | 1.3% |
| Value | Count | Frequency (%) | |
| Latin | 66 | 88.0% | |
| Common | 9 | 12.0% |
| Value | Count | Frequency (%) | |
| ASCII | 59 | 98.3% | |
| Punctuation | 1 | 1.7% |
| Distinct count | 17 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 70191 |
| Missing (%) | 59.8% |
| Memory size | 1.8 MiB |
| Central Coast | |
|---|---|
| Sonoma | |
| Columbia Valley | |
| Napa | |
| Willamette Valley | |
| Other values (12) |
| Value | Count | Frequency (%) | |
| Central Coast | 10552 | 9.0% | |
| Sonoma | 8525 | 7.3% | |
| Columbia Valley | 7773 | 6.6% | |
| Napa | 6395 | 5.4% | |
| Willamette Valley | 3211 | 2.7% | |
| California Other | 2181 | 1.9% | |
| Finger Lakes | 1609 | 1.4% | |
| Sierra Foothills | 1349 | 1.1% | |
| Napa-Sonoma | 1081 | 0.9% | |
| Central Valley | 989 | 0.8% | |
| Other values (7) | 3494 | 3.0% | |
| (Missing) | 70191 | 59.8% |
Length
| Max length | 17 |
|---|---|
| Mean length | 6.327822752 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 20 | 62.5% | |
| Uppercase_Letter | 10 | 31.2% | |
| Dash_Punctuation | 1 | 3.1% | |
| Space_Separator | 1 | 3.1% |
| Value | Count | Frequency (%) | |
| Latin | 30 | 93.8% | |
| Common | 2 | 6.2% |
| Value | Count | Frequency (%) | |
| ASCII | 32 | 100.0% |
| Distinct count | 19 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 23022 |
| Missing (%) | 19.6% |
| Memory size | 1.8 MiB |
| Roger Voss | |
|---|---|
| Michael Schachner | |
| Kerin O’Keefe | |
| Virginie Boone | |
| Paul Gregutt | |
| Other values (14) |
| Value | Count | Frequency (%) | |
| Roger Voss | 22111 | 18.8% | |
| Michael Schachner | 13565 | 11.6% | |
| Kerin O’Keefe | 10099 | 8.6% | |
| Virginie Boone | 9222 | 7.9% | |
| Paul Gregutt | 8912 | 7.6% | |
| Matt Kettmann | 6057 | 5.2% | |
| Sean P. Sullivan | 4751 | 4.0% | |
| Joe Czerwinski | 4429 | 3.8% | |
| Anna Lee C. Iijima | 4128 | 3.5% | |
| Jim Gordon | 3781 | 3.2% | |
| Other values (9) | 7273 | 6.2% | |
| (Missing) | 23022 | 19.6% |
Length
| Max length | 18 |
|---|---|
| Mean length | 11.2847124 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 24 | 53.3% | |
| Uppercase_Letter | 17 | 37.8% | |
| Space_Separator | 2 | 4.4% | |
| Final_Punctuation | 1 | 2.2% | |
| Other_Punctuation | 1 | 2.2% |
| Value | Count | Frequency (%) | |
| Latin | 41 | 91.1% | |
| Common | 4 | 8.9% |
| Value | Count | Frequency (%) | |
| ASCII | 43 | 97.7% | |
| Punctuation | 1 | 2.3% |
| Distinct count | 15 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 27658 |
| Missing (%) | 23.6% |
| Memory size | 1.8 MiB |
| @vossroger | |
|---|---|
| @wineschach | |
| @kerinokeefe | |
| @vboone | |
| @paulgwine | |
| Other values (10) |
| Value | Count | Frequency (%) | |
| @vossroger | 22111 | 18.8% | |
| @wineschach | 13565 | 11.6% | |
| @kerinokeefe | 10099 | 8.6% | |
| @vboone | 9222 | 7.9% | |
| @paulgwine | 8912 | 7.6% | |
| @mattkettmann | 6057 | 5.2% | |
| @wawinereport | 4751 | 4.0% | |
| @JoeCz | 4429 | 3.8% | |
| @gordone_cellars | 3781 | 3.2% | |
| @AnneInVino | 3127 | 2.7% | |
| Other values (5) | 3638 | 3.1% | |
| (Missing) | 27658 | 23.6% |
Length
| Max length | 16 |
|---|---|
| Mean length | 8.866723477 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 23 | 74.2% | |
| Uppercase_Letter | 5 | 16.1% | |
| Connector_Punctuation | 1 | 3.2% | |
| Space_Separator | 1 | 3.2% | |
| Other_Punctuation | 1 | 3.2% |
| Value | Count | Frequency (%) | |
| Latin | 28 | 90.3% | |
| Common | 3 | 9.7% |
| Value | Count | Frequency (%) | |
| ASCII | 30 | 100.0% |
| Distinct count | 73194 |
|---|---|
| Unique (%) | 62.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Château Malartic-Lagravière Pessac-Léognan | 15 |
|---|---|
| Château Pape Clément Pessac-Léognan | 15 |
| Château Couhins-Lurton Pessac-Léognan | 15 |
| Château Smith Haut Lafitte Pessac-Léognan | 14 |
| Domaine de Chevalier Pessac-Léognan | 14 |
| Other values (73189) |
| Value | Count | Frequency (%) | |
| Château Malartic-Lagravière Pessac-Léognan | 15 | < 0.1% | |
| Château Pape Clément Pessac-Léognan | 15 | < 0.1% | |
| Château Couhins-Lurton Pessac-Léognan | 15 | < 0.1% | |
| Château Smith Haut Lafitte Pessac-Léognan | 14 | < 0.1% | |
| Domaine de Chevalier Pessac-Léognan | 14 | < 0.1% | |
| Château Bouscaut Pessac-Léognan | 13 | < 0.1% | |
| Domäne Wachau Terrassen Federspiel Grüner Veltliner (Wachau) | 12 | < 0.1% | |
| Château Olivier Pessac-Léognan | 12 | < 0.1% | |
| Dutton-Goldfield Dutton Ranch Pinot Noir (Russian River Valley) | 11 | < 0.1% | |
| Château d'Esclans Les Clans Rosé (Côtes de Provence) | 11 | < 0.1% | |
| Other values (73184) | 117218 | 99.9% |
Length
| Max length | 132 |
|---|---|
| Mean length | 48.71018321 |
| Min length | 8 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 70 | 43.5% | |
| Uppercase_Letter | 45 | 28.0% | |
| Other_Punctuation | 15 | 9.3% | |
| Decimal_Number | 10 | 6.2% | |
| Final_Punctuation | 2 | 1.2% | |
| Control | 2 | 1.2% | |
| Other_Symbol | 2 | 1.2% | |
| Close_Punctuation | 2 | 1.2% | |
| Math_Symbol | 2 | 1.2% | |
| Space_Separator | 2 | 1.2% | |
| Other values (6) | 9 | 5.6% |
| Value | Count | Frequency (%) | |
| Latin | 115 | 71.4% | |
| Common | 46 | 28.6% |
| Value | Count | Frequency (%) | |
| ASCII | 86 | 92.5% | |
| Punctuation | 6 | 6.5% | |
| Geometric Shapes | 1 | 1.1% |
| Distinct count | 674 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Pinot Noir | 12703 |
|---|---|
| Chardonnay | 10687 |
| Cabernet Sauvignon | 8981 |
| Red Blend | 8420 |
| Bordeaux-style Red Blend | 6729 |
| Other values (669) |
| Value | Count | Frequency (%) | |
| Pinot Noir | 12703 | 10.8% | |
| Chardonnay | 10687 | 9.1% | |
| Cabernet Sauvignon | 8981 | 7.7% | |
| Red Blend | 8420 | 7.2% | |
| Bordeaux-style Red Blend | 6729 | 5.7% | |
| Riesling | 4822 | 4.1% | |
| Sauvignon Blanc | 4297 | 3.7% | |
| Syrah | 4029 | 3.4% | |
| Merlot | 2857 | 2.4% | |
| Nebbiolo | 2753 | 2.3% | |
| Other values (664) | 51072 | 43.5% |
Length
| Max length | 35 |
|---|---|
| Mean length | 11.70023008 |
| Min length | 4 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 48 | 60.0% | |
| Uppercase_Letter | 27 | 33.8% | |
| Other_Punctuation | 3 | 3.8% | |
| Dash_Punctuation | 1 | 1.2% | |
| Space_Separator | 1 | 1.2% |
| Value | Count | Frequency (%) | |
| Latin | 75 | 93.8% | |
| Common | 5 | 6.2% |
| Value | Count | Frequency (%) | |
| ASCII | 56 | 100.0% |
| Distinct count | 15595 |
|---|---|
| Unique (%) | 13.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Testarossa | 209 |
|---|---|
| Williams Selyem | 206 |
| Wines & Winemakers | 195 |
| Georges Duboeuf | 195 |
| DFJ Vinhos | 192 |
| Other values (15590) |
| Value | Count | Frequency (%) | |
| Testarossa | 209 | 0.2% | |
| Williams Selyem | 206 | 0.2% | |
| Wines & Winemakers | 195 | 0.2% | |
| Georges Duboeuf | 195 | 0.2% | |
| DFJ Vinhos | 192 | 0.2% | |
| Chateau Ste. Michelle | 186 | 0.2% | |
| Louis Latour | 180 | 0.2% | |
| Columbia Crest | 152 | 0.1% | |
| Concha y Toro | 144 | 0.1% | |
| Gary Farrell | 124 | 0.1% | |
| Other values (15585) | 115567 | 98.5% |
Length
| Max length | 54 |
|---|---|
| Mean length | 12.32565829 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 59 | 48.0% | |
| Uppercase_Letter | 37 | 30.1% | |
| Decimal_Number | 10 | 8.1% | |
| Other_Punctuation | 9 | 7.3% | |
| Space_Separator | 2 | 1.6% | |
| Math_Symbol | 2 | 1.6% | |
| Open_Punctuation | 1 | 0.8% | |
| Other_Symbol | 1 | 0.8% | |
| Close_Punctuation | 1 | 0.8% | |
| Dash_Punctuation | 1 | 0.8% |
| Value | Count | Frequency (%) | |
| Latin | 96 | 78.0% | |
| Common | 27 | 22.0% |
| Value | Count | Frequency (%) | |
| ASCII | 76 | 98.7% | |
| Punctuation | 1 | 1.3% |
harvest
Real number (ℝ≥0)
| Distinct count | 12 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2010.9067234767788 |
|---|---|
| Minimum | 2004 |
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 2004 |
|---|---|
| 5-th percentile | 2006 |
| Q1 | 2009 |
| median | 2011 |
| Q3 | 2013 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 11 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.876632965 |
|---|---|
| Coefficient of variation (CV) | 0.001430515365 |
| Kurtosis | -0.6787605214 |
| Mean | 2010.906723 |
| Median Absolute Deviation (MAD) | 2.399656857 |
| Skewness | -0.4918496 |
| Sum | 235979904 |
| Variance | 8.275017215 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2004. 2005.5 2006.5 2007.5 2008.5 2009.5 2011.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2013 | 15853 | 13.5% | |
| 2012 | 15723 | 13.4% | |
| 2014 | 15568 | 13.3% | |
| 2011 | 12531 | 10.7% | |
| 2010 | 12149 | 10.4% | |
| 2015 | 10041 | 8.6% | |
| 2009 | 9864 | 8.4% | |
| 2008 | 7426 | 6.3% | |
| 2007 | 7041 | 6.0% | |
| 2006 | 5772 | 4.9% | |
| Other values (2) | 5382 | 4.6% |
| Value | Count | Frequency (%) | |
| 2004 | 1772 | 1.5% | |
| 2005 | 3610 | 3.1% | |
| 2006 | 5772 | 4.9% | |
| 2007 | 7041 | 6.0% | |
| 2008 | 7426 | 6.3% |
| Value | Count | Frequency (%) | |
| 2015 | 10041 | 8.6% | |
| 2014 | 15568 | 13.3% | |
| 2013 | 15853 | 13.5% | |
| 2012 | 15723 | 13.4% | |
| 2011 | 12531 | 10.7% |
| Distinct count | 108109 |
|---|---|
| Unique (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| blended viognier grenache fragrant offering display aroma fresh flower herb green olive huckleberry smoked meat palate ripe fruit forward bit balled middle right showing pleasing sense purity length match give time come | 3 |
|---|---|
| zesty red pretty aroma suggest small red berry blue flower whiff moist soil vibrant palate offer sour cherry pomegranate hint anise alongside zesty acidity refined tannin | 3 |
| cigar box caf lait dried tobacco aroma followed coffee cherry flavor barrel spice lingering finish wood get bit front still delivers enjoyment | 3 |
| gravenstein apple honeysuckle jasmine aroma show relatively boisterous nose bottling large vineyard highway east paso roble compellingly grippy texture sip ripe flavor pear honeydew melon salty acidity take next level | 3 |
| seductively tart lemon pith cranberry pomegranate refreshing light bodied quaff infinitely enjoyable table continues expand palate increasing array fresh flavor finishing cherry orange | 3 |
| Other values (108104) |
| Value | Count | Frequency (%) | |
| blended viognier grenache fragrant offering display aroma fresh flower herb green olive huckleberry smoked meat palate ripe fruit forward bit balled middle right showing pleasing sense purity length match give time come | 3 | < 0.1% | |
| zesty red pretty aroma suggest small red berry blue flower whiff moist soil vibrant palate offer sour cherry pomegranate hint anise alongside zesty acidity refined tannin | 3 | < 0.1% | |
| cigar box caf lait dried tobacco aroma followed coffee cherry flavor barrel spice lingering finish wood get bit front still delivers enjoyment | 3 | < 0.1% | |
| gravenstein apple honeysuckle jasmine aroma show relatively boisterous nose bottling large vineyard highway east paso roble compellingly grippy texture sip ripe flavor pear honeydew melon salty acidity take next level | 3 | < 0.1% | |
| seductively tart lemon pith cranberry pomegranate refreshing light bodied quaff infinitely enjoyable table continues expand palate increasing array fresh flavor finishing cherry orange | 3 | < 0.1% | |
| ripe plum game truffle leather menthol aroma find earthy wine tightly wound palate offer dried black cherry chopped sage mint roasted coffee bean alongside raspy tannin leave mouth drying finish | 3 | < 0.1% | |
| one cellar vibrantly rich forward blackberry black cherry plum dark chocolate sweet smoky new oak yet tannic creates astringency last spicy finish would pity open last year | 2 | < 0.1% | |
| orange mandarin flavor mark warm full bodied wine forward fruity showing sweeter character vanilla spice tone drink soon | 2 | < 0.1% | |
| come point pursuit restraint becomes fruitless wine show wonderfully silky texture elegant finish subtle aroma flavor roasted cashew citrus indeed understated | 2 | < 0.1% | |
| blend roussanne grenache blanc offer broad aroma cooked apple honeydew melon peach custard palate tighter seared green apple lemon rind yellow grapefruit zest slowly open tropically banana element full bodied white wine | 2 | < 0.1% | |
| Other values (108099) | 117324 | > 99.9% |
Length
| Max length | 594 |
|---|---|
| Mean length | 171.5826331 |
| Min length | 16 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 26 | 96.3% | |
| Space_Separator | 1 | 3.7% |
| Value | Count | Frequency (%) | |
| Latin | 26 | 96.3% | |
| Common | 1 | 3.7% |
| Value | Count | Frequency (%) | |
| ASCII | 27 | 100.0% |
clusters
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 0 | |
| 3 |
| Value | Count | Frequency (%) | |
| 1 | 43000 | 36.6% | |
| 2 | 30523 | 26.0% | |
| 0 | 23013 | 19.6% | |
| 3 | 20814 | 17.7% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
PCA0_target
Real number (ℝ≥0)
| Distinct count | 108099 |
|---|---|
| Unique (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.232140867080342 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.4482370130352956 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1520708595 |
| Q1 | 0.1979928106 |
| median | 0.2314538989 |
| Q3 | 0.2660606898 |
| 95-th percentile | 0.3141067791 |
| Maximum | 0.448237013 |
| Range | 0.448237013 |
| Interquartile range (IQR) | 0.06806787922 |
Descriptive statistics
| Standard deviation | 0.04934924386 |
|---|---|
| Coefficient of variation (CV) | 0.2125831805 |
| Kurtosis | -0.1188513719 |
| Mean | 0.2321408671 |
| Median Absolute Deviation (MAD) | 0.03966507301 |
| Skewness | 0.0370530663 |
| Sum | 27241.73075 |
| Variance | 0.002435347869 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.06410098 0.08431426 0.09782379 0.10766119 ... 0.36275448 0.37383079 0.3856988 0.40749916 0.44823701], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 0.2096395862 | 3 | < 0.1% | |
| 0.2101562857 | 3 | < 0.1% | |
| 0.2149799543 | 3 | < 0.1% | |
| 0.2034564855 | 3 | < 0.1% | |
| 0.2201976122 | 3 | < 0.1% | |
| 0.199088648 | 3 | < 0.1% | |
| 0.2693375665 | 3 | < 0.1% | |
| 0.1959518392 | 2 | < 0.1% | |
| 0.248919225 | 2 | < 0.1% | |
| Other values (108089) | 117321 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 0.02622330587 | 1 | < 0.1% | |
| 0.0359165718 | 1 | < 0.1% | |
| 0.03592702426 | 1 | < 0.1% | |
| 0.03629950478 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.448237013 | 1 | < 0.1% | |
| 0.4352325328 | 1 | < 0.1% | |
| 0.4326741475 | 1 | < 0.1% | |
| 0.4199883241 | 1 | < 0.1% | |
| 0.4198175477 | 1 | < 0.1% |
PCA1_target
Real number (ℝ)
| Distinct count | 108099 |
|---|---|
| Unique (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.006193907760403038 |
|---|---|
| Minimum | -0.4027119778301928 |
| Maximum | 0.4137258970928985 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | -0.4027119778 |
|---|---|
| 5-th percentile | -0.2127626816 |
| Q1 | -0.1059116603 |
| median | 0.01225191505 |
| Q3 | 0.0839842402 |
| 95-th percentile | 0.1811133257 |
| Maximum | 0.4137258971 |
| Range | 0.8164378749 |
| Interquartile range (IQR) | 0.1898959005 |
Descriptive statistics
| Standard deviation | 0.1229796658 |
|---|---|
| Coefficient of variation (CV) | -19.85493982 |
| Kurtosis | -0.6508586456 |
| Mean | -0.00619390776 |
| Median Absolute Deviation (MAD) | 0.1026372967 |
| Skewness | -0.1816010743 |
| Sum | -726.8550757 |
| Variance | 0.01512399821 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.40271198 -0.36548486 -0.3330575 -0.31414813 -0.29076813 ... 0.27799744 0.30596313 0.33703035 0.35862729 0.4137259 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| -0.1556769728 | 3 | < 0.1% | |
| -0.05060963262 | 3 | < 0.1% | |
| 0.09438424415 | 3 | < 0.1% | |
| 0.2056511378 | 3 | < 0.1% | |
| -0.1141648182 | 3 | < 0.1% | |
| -0.03109116656 | 3 | < 0.1% | |
| 0.07060915926 | 3 | < 0.1% | |
| 0.1411865741 | 2 | < 0.1% | |
| 0.05966207114 | 2 | < 0.1% | |
| Other values (108089) | 117321 | > 99.9% |
| Value | Count | Frequency (%) | |
| -0.4027119778 | 1 | < 0.1% | |
| -0.4026497397 | 1 | < 0.1% | |
| -0.3858573992 | 1 | < 0.1% | |
| -0.3804127197 | 1 | < 0.1% | |
| -0.3799128848 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.4137258971 | 1 | < 0.1% | |
| 0.3720501363 | 1 | < 0.1% | |
| 0.3653804856 | 2 | < 0.1% | |
| 0.3598507684 | 1 | < 0.1% | |
| 0.3574038022 | 1 | < 0.1% |
| Distinct count | 106339 |
|---|---|
| Unique (%) | 90.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| [] | 83 |
|---|---|
| ['wine'] | 41 |
| ['flavor'] | 38 |
| ['acidity'] | 32 |
| ['finish', 'flavor'] | 23 |
| Other values (106334) |
| Value | Count | Frequency (%) | |
| [] | 83 | 0.1% | |
| ['wine'] | 41 | < 0.1% | |
| ['flavor'] | 38 | < 0.1% | |
| ['acidity'] | 32 | < 0.1% | |
| ['finish', 'flavor'] | 23 | < 0.1% | |
| ['wine', 'flavor'] | 19 | < 0.1% | |
| ['finish'] | 18 | < 0.1% | |
| ['palate', 'acidity'] | 17 | < 0.1% | |
| ['wine', 'fruit'] | 17 | < 0.1% | |
| ['flavor', 'fruit'] | 16 | < 0.1% | |
| Other values (106329) | 117046 | 99.7% |
Length
| Max length | 71 |
|---|---|
| Mean length | 47.44003409 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 26 | 83.9% | |
| Other_Punctuation | 2 | 6.5% | |
| Close_Punctuation | 1 | 3.2% | |
| Space_Separator | 1 | 3.2% | |
| Open_Punctuation | 1 | 3.2% |
| Value | Count | Frequency (%) | |
| Latin | 26 | 83.9% | |
| Common | 5 | 16.1% |
| Value | Count | Frequency (%) | |
| ASCII | 31 | 100.0% |
| Distinct count | 5419 |
|---|---|
| Unique (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.508675841499787 |
|---|---|
| Minimum | -0.9288 |
| Maximum | 0.9937 |
| Zeros | 12056 |
| Zeros (%) | 10.3% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | -0.9288 |
|---|---|
| 5-th percentile | -0.1779 |
| Q1 | 0.2732 |
| median | 0.6191 |
| Q3 | 0.8145 |
| 95-th percentile | 0.9287 |
| Maximum | 0.9937 |
| Range | 1.9225 |
| Interquartile range (IQR) | 0.5413 |
Descriptive statistics
| Standard deviation | 0.3724939936 |
|---|---|
| Coefficient of variation (CV) | 0.7322816678 |
| Kurtosis | 0.02325142567 |
| Mean | 0.5086758415 |
| Median Absolute Deviation (MAD) | 0.3077730608 |
| Skewness | -0.900466504 |
| Sum | 59693.11 |
| Variance | 0.1387517753 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.9288 -0.8922 -0.8241 -0.73525 -0.73485 ... 0.97125 0.97765 0.984 0.98895 0.9937 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 12056 | 10.3% | |
| 0.4404 | 2907 | 2.5% | |
| 0.3182 | 1909 | 1.6% | |
| 0.4588 | 1750 | 1.5% | |
| 0.5574 | 1730 | 1.5% | |
| 0.6369 | 1456 | 1.2% | |
| 0.4939 | 1450 | 1.2% | |
| 0.5106 | 1325 | 1.1% | |
| 0.4215 | 1223 | 1.0% | |
| 0.7003 | 1187 | 1.0% | |
| Other values (5409) | 90357 | 77.0% |
| Value | Count | Frequency (%) | |
| -0.9288 | 1 | < 0.1% | |
| -0.9153 | 1 | < 0.1% | |
| -0.8934 | 1 | < 0.1% | |
| -0.891 | 1 | < 0.1% | |
| -0.89 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.9937 | 1 | < 0.1% | |
| 0.9911 | 1 | < 0.1% | |
| 0.991 | 2 | < 0.1% | |
| 0.9907 | 1 | < 0.1% | |
| 0.9896 | 1 | < 0.1% |
| Distinct count | 517 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8504438346825732 |
|---|---|
| Minimum | 0.381 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0.381 |
|---|---|
| 5-th percentile | 0.688 |
| Q1 | 0.789 |
| median | 0.856 |
| Q3 | 0.919 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 0.619 |
| Interquartile range (IQR) | 0.13 |
Descriptive statistics
| Standard deviation | 0.093378599 |
|---|---|
| Coefficient of variation (CV) | 0.1097998424 |
| Kurtosis | -0.1208399178 |
| Mean | 0.8504438347 |
| Median Absolute Deviation (MAD) | 0.07553674816 |
| Skewness | -0.3851154773 |
| Sum | 99799.584 |
| Variance | 0.008719562751 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.381 0.4745 0.5235 0.5555 0.5785 ... 0.9775 0.9805 0.9835 0.992 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 11779 | 10.0% | |
| 0.909 | 989 | 0.8% | |
| 0.833 | 823 | 0.7% | |
| 0.87 | 642 | 0.5% | |
| 0.912 | 627 | 0.5% | |
| 0.906 | 604 | 0.5% | |
| 0.882 | 578 | 0.5% | |
| 0.896 | 575 | 0.5% | |
| 0.864 | 572 | 0.5% | |
| 0.938 | 556 | 0.5% | |
| Other values (507) | 99605 | 84.9% |
| Value | Count | Frequency (%) | |
| 0.381 | 1 | < 0.1% | |
| 0.386 | 1 | < 0.1% | |
| 0.394 | 1 | < 0.1% | |
| 0.395 | 1 | < 0.1% | |
| 0.409 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 11779 | 10.0% | |
| 0.984 | 2 | < 0.1% | |
| 0.983 | 2 | < 0.1% | |
| 0.982 | 7 | < 0.1% | |
| 0.981 | 4 | < 0.1% |
| Distinct count | 297 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.015264772049424799 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.509 |
| Zeros | 88537 |
| Zeros (%) | 75.4% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.081 |
| Maximum | 0.509 |
| Range | 0.509 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.0322199173 |
|---|---|
| Coefficient of variation (CV) | 2.110736878 |
| Kurtosis | 11.38529399 |
| Mean | 0.01526477205 |
| Median Absolute Deviation (MAD) | 0.02303492133 |
| Skewness | 2.825806854 |
| Sum | 1791.321 |
| Variance | 0.001038123071 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.0055 0.0115 0.0135 0.0165 ... 0.2055 0.2285 0.2845 0.3435 0.509 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 88537 | 75.4% | |
| 0.036 | 531 | 0.5% | |
| 0.043 | 524 | 0.4% | |
| 0.034 | 517 | 0.4% | |
| 0.046 | 513 | 0.4% | |
| 0.045 | 511 | 0.4% | |
| 0.038 | 495 | 0.4% | |
| 0.041 | 481 | 0.4% | |
| 0.047 | 475 | 0.4% | |
| 0.048 | 473 | 0.4% | |
| Other values (287) | 24293 | 20.7% |
| Value | Count | Frequency (%) | |
| 0 | 88537 | 75.4% | |
| 0.011 | 3 | < 0.1% | |
| 0.012 | 5 | < 0.1% | |
| 0.013 | 8 | < 0.1% | |
| 0.014 | 18 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.509 | 1 | < 0.1% | |
| 0.453 | 1 | < 0.1% | |
| 0.443 | 1 | < 0.1% | |
| 0.372 | 1 | < 0.1% | |
| 0.371 | 1 | < 0.1% |
| Distinct count | 510 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13429123135918195 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.619 |
| Zeros | 17207 |
| Zeros (%) | 14.7% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.066 |
| median | 0.127 |
| Q3 | 0.197 |
| 95-th percentile | 0.299 |
| Maximum | 0.619 |
| Range | 0.619 |
| Interquartile range (IQR) | 0.131 |
Descriptive statistics
| Standard deviation | 0.09361384992 |
|---|---|
| Coefficient of variation (CV) | 0.6970957744 |
| Kurtosis | -0.1359213442 |
| Mean | 0.1342912314 |
| Median Absolute Deviation (MAD) | 0.07616004881 |
| Skewness | 0.4654765968 |
| Sum | 15759.076 |
| Variance | 0.008763552897 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.007 0.0155 0.0195 0.0225 ... 0.4375 0.4555 0.49 0.5255 0.619 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 17207 | 14.7% | |
| 0.091 | 937 | 0.8% | |
| 0.167 | 693 | 0.6% | |
| 0.088 | 658 | 0.6% | |
| 0.094 | 593 | 0.5% | |
| 0.13 | 589 | 0.5% | |
| 0.086 | 562 | 0.5% | |
| 0.079 | 557 | 0.5% | |
| 0.081 | 554 | 0.5% | |
| 0.077 | 553 | 0.5% | |
| Other values (500) | 94447 | 80.5% |
| Value | Count | Frequency (%) | |
| 0 | 17207 | 14.7% | |
| 0.014 | 1 | < 0.1% | |
| 0.015 | 1 | < 0.1% | |
| 0.016 | 3 | < 0.1% | |
| 0.017 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.619 | 1 | < 0.1% | |
| 0.606 | 1 | < 0.1% | |
| 0.605 | 1 | < 0.1% | |
| 0.591 | 1 | < 0.1% | |
| 0.588 | 1 | < 0.1% |
sentiment
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| POSITIVE | |
|---|---|
| NEUTRAL | 12056 |
| NEGATIVE | 8744 |
| Value | Count | Frequency (%) | |
| POSITIVE | 96550 | 82.3% | |
| NEUTRAL | 12056 | 10.3% | |
| NEGATIVE | 8744 | 7.5% |
Length
| Max length | 8 |
|---|---|
| Mean length | 7.897264593 |
| Min length | 7 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 13 | 100.0% |
| Value | Count | Frequency (%) | |
| Latin | 13 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 13 | 100.0% |
rating_cat
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 | |
| 4 | 5744 |
| 0 | 2489 |
| Value | Count | Frequency (%) | |
| 2 | 41518 | 35.4% | |
| 3 | 39794 | 33.9% | |
| 1 | 27689 | 23.6% | |
| 4 | 5744 | 4.9% | |
| 0 | 2489 | 2.1% | |
| 5 | 116 | 0.1% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 6 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 6 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 6 | 100.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| country | description | designation | points | price | province | region_1 | region_2 | taster_name | taster_twitter_handle | title | variety | winery | harvest | description_desc | clusters | PCA0_target | PCA1_target | aspects | polarity_score | neutral_score | negative_score | positive_score | sentiment | rating_cat | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Italy | Aromas include tropical fruit, broom, brimstone and dried herb. The palate isn't overly expressive, offering unripened apple, citrus and dried sage alongside brisk acidity. | Vulkà Bianco | 87 | NaN | Sicily & Sardinia | Etna | NaN | Kerin O’Keefe | @kerinokeefe | Nicosia Vulkà Bianco (Etna) | White Blend | Nicosia | 2013 | aroma include tropical fruit broom brimstone dried herb palate overly expressive offering unripened apple citrus dried sage alongside brisk acidity | 2 | 0.202360 | -0.097448 | ['broom', 'acidity', 'herb', 'brimstone', 'apple'] | 0.1531 | 0.935 | 0.000 | 0.065 | POSITIVE | 2 |
| 1 | Portugal | This is ripe and fruity, a wine that is smooth while still structured. Firm tannins are filled out with juicy red berry fruits and freshened with acidity. It's already drinkable, although it will certainly be better from 2016. | Avidagos | 87 | 15.0 | Douro | NaN | NaN | Roger Voss | @vossroger | Quinta dos Avidagos Avidagos Red (Douro) | Portuguese Red | Quinta dos Avidagos | 2011 | ripe fruity wine smooth still structured firm tannin filled juicy red berry fruit freshened acidity already drinkable although certainly better | 3 | 0.274710 | 0.030943 | ['acidity', 'firm', 'fruity', 'wine', 'tannin'] | 0.6486 | 0.868 | 0.000 | 0.132 | POSITIVE | 2 |
| 2 | US | Tart and snappy, the flavors of lime flesh and rind dominate. Some green pineapple pokes through, with crisp acidity underscoring the flavors. The wine was all stainless-steel fermented. | NaN | 87 | 14.0 | Oregon | Willamette Valley | Willamette Valley | Paul Gregutt | @paulgwine | Rainstorm Pinot Gris (Willamette Valley) | Pinot Gris | Rainstorm | 2013 | tart snappy flavor lime flesh rind dominate green pineapple poke crisp acidity underscoring flavor wine stainless steel fermented | 2 | 0.189913 | -0.219316 | ['steel', 'acidity', 'wine', 'pineapple', 'flavor'] | -0.1280 | 0.947 | 0.053 | 0.000 | NEGATIVE | 2 |
| 3 | US | Pineapple rind, lemon pith and orange blossom start off the aromas. The palate is a bit more opulent, with notes of honey-drizzled guava and mango giving way to a slightly astringent, semidry finish. | Reserve Late Harvest | 87 | 13.0 | Michigan | Lake Michigan Shore | NaN | Alexander Peartree | NaN | St. Julian Reserve Late Harvest Riesling (Lake Michigan Shore) | Riesling | St. Julian | 2013 | pineapple rind lemon pith orange blossom start aroma palate bit opulent note honey drizzled guava mango giving way slightly astringent semidry finish | 2 | 0.157455 | -0.114367 | ['bit', 'palate', 'mango', 'finish', 'note'] | 0.3400 | 0.926 | 0.000 | 0.074 | POSITIVE | 2 |
| 4 | US | Much like the regular bottling from 2012, this comes across as rather rough and tannic, with rustic, earthy, herbal characteristics. Nonetheless, if you think of it as a pleasantly unfussy country wine, it's a good companion to a hearty winter stew. | Vintner's Reserve Wild Child Block | 87 | 65.0 | Oregon | Willamette Valley | Willamette Valley | Paul Gregutt | @paulgwine | Sweet Cheeks Vintner's Reserve Wild Child Block Pinot Noir (Willamette Valley) | Pinot Noir | Sweet Cheeks | 2012 | much like regular bottling come across rather rough tannic rustic earthy herbal characteristic nonetheless think pleasantly unfussy country wine good companion hearty winter stew | 1 | 0.137731 | 0.014431 | ['companion', 'country', 'bottling', 'wine'] | 0.8176 | 0.805 | 0.000 | 0.195 | POSITIVE | 2 |
| 5 | Spain | Blackberry and raspberry aromas show a typical Navarran whiff of green herbs and, in this case, horseradish. In the mouth, this is fairly full bodied, with tomatoey acidity. Spicy, herbal flavors complement dark plum fruit, while the finish is fresh but grabby. | Ars In Vitro | 87 | 15.0 | Northern Spain | Navarra | NaN | Michael Schachner | @wineschach | Tandem Ars In Vitro Tempranillo-Merlot (Navarra) | Tempranillo-Merlot | Tandem | 2011 | blackberry raspberry aroma show typical navarran whiff green herb case horseradish mouth fairly full bodied tomatoey acidity spicy herbal flavor complement dark plum fruit finish fresh grabby | 1 | 0.371027 | 0.035006 | ['case', 'acidity', 'tomatoey', 'flavor', 'mouth'] | 0.1655 | 0.960 | 0.000 | 0.040 | POSITIVE | 2 |
| 6 | Italy | Here's a bright, informal red that opens with aromas of candied berry, white pepper and savory herb that carry over to the palate. It's balanced with fresh acidity and soft tannins. | Belsito | 87 | 16.0 | Sicily & Sardinia | Vittoria | NaN | Kerin O’Keefe | @kerinokeefe | Terre di Giurfo Belsito Frappato (Vittoria) | Frappato | Terre di Giurfo | 2013 | bright informal red open aroma candied berry white pepper savory herb carry palate balanced fresh acidity soft tannin | 0 | 0.271530 | 0.000502 | ['aroma', 'palate', 'herb', 'tannin', 'acidity'] | 0.6369 | 0.843 | 0.000 | 0.157 | POSITIVE | 2 |
| 7 | France | This dry and restrained wine offers spice in profusion. Balanced with acidity and a firm texture, it's very much for food. | NaN | 87 | 24.0 | Alsace | Alsace | NaN | Roger Voss | @vossroger | Trimbach Gewurztraminer (Alsace) | Gewürztraminer | Trimbach | 2012 | dry restrained wine offer spice profusion balanced acidity firm texture much food | 3 | 0.204479 | -0.055572 | ['profusion', 'offer', 'acidity', 'wine', 'firm'] | 0.0000 | 1.000 | 0.000 | 0.000 | NEUTRAL | 2 |
| 8 | Germany | Savory dried thyme notes accent sunnier flavors of preserved peach in this brisk, off-dry wine. It's fruity and fresh, with an elegant, sprightly footprint. | Shine | 87 | 12.0 | Rheinhessen | NaN | NaN | Anna Lee C. Iijima | NaN | Heinz Eifel Shine Gewürztraminer (Rheinhessen) | Gewürztraminer | Heinz Eifel | 2013 | savory dried thyme note accent sunnier flavor preserved peach brisk dry wine fruity fresh elegant sprightly footprint | 2 | 0.215866 | -0.118174 | ['wine', 'sunnier', 'accent', 'thyme', 'fruity'] | 0.9091 | 0.586 | 0.000 | 0.414 | POSITIVE | 2 |
| 9 | France | This has great depth of flavor with its fresh apple and pear fruits and touch of spice. It's off dry while balanced with acidity and a crisp texture. Drink now. | Les Natures | 87 | 27.0 | Alsace | Alsace | NaN | Roger Voss | @vossroger | Jean-Baptiste Adam Les Natures Pinot Gris (Alsace) | Pinot Gris | Jean-Baptiste Adam | 2012 | great depth flavor fresh apple pear fruit touch spice dry balanced acidity crisp texture drink | 2 | 0.305882 | -0.281712 | ['texture', 'acidity', 'depth', 'pear', 'apple'] | 0.7506 | 0.808 | 0.000 | 0.192 | POSITIVE | 2 |
Last rows
| country | description | designation | points | price | province | region_1 | region_2 | taster_name | taster_twitter_handle | title | variety | winery | harvest | description_desc | clusters | PCA0_target | PCA1_target | aspects | polarity_score | neutral_score | negative_score | positive_score | sentiment | rating_cat | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 117340 | Italy | Intense aromas of wild cherry, baking spice, tilled soil and savory herb lead the nose on this soulful, silky red. The round, smooth palate doles out juicy red cherry, strawberry jelly, mineral, white pepper and an intriguing note of zabaglione alongside soft, supple tannins and bright acidity.. | NaN | 90 | 30.0 | Sicily & Sardinia | Sicilia | NaN | Kerin O’Keefe | @kerinokeefe | COS Frappato (Sicilia) | Frappato | COS | 2013 | intense aroma wild cherry baking spice tilled soil savory herb lead nose soulful silky red round smooth palate dole juicy red cherry strawberry jelly mineral white pepper intriguing note zabaglione alongside soft supple tannin bright acidity | 0 | 0.322159 | 0.093441 | ['acidity', 'lead', 'herb', 'dole', 'spice'] | 0.4939 | 0.915 | 0.000 | 0.085 | POSITIVE | 3 |
| 117341 | Italy | Blackberry, cassis, grilled herb and toasted aromas come together in the glass. On the palate, espresso, mint and black pepper add depth to the core of black cherry and blackberry flavors. It finishes on a licorice note. | Sàgana Tenuta San Giacomo | 90 | 40.0 | Sicily & Sardinia | Sicilia | NaN | Kerin O’Keefe | @kerinokeefe | Cusumano Sàgana Tenuta San Giacomo Nero d'Avola (Sicilia) | Nero d'Avola | Cusumano | 2012 | blackberry cassis grilled herb toasted aroma come together glass palate espresso mint black pepper add depth core black cherry blackberry flavor finish licorice note | 0 | 0.275980 | 0.240302 | ['herb', 'flavor', 'note', 'pepper', 'palate'] | 0.0000 | 1.000 | 0.000 | 0.000 | NEUTRAL | 3 |
| 117342 | Israel | A bouquet of black cherry, tart cranberry and clove opens into flavors of cherry, anisette, espresso bean and mint, with a hint of tart cranberry. The minty notes can almost seem overly strong for a moment, but tart tones bring the fruit flavors back to the foreground. The pleasantly gripping tannins will mellow with a few more years of aging. | Oak Aged | 90 | 20.0 | Galilee | NaN | NaN | Mike DeSimone | @worldwineguys | Dalton Oak Aged Cabernet Sauvignon (Galilee) | Cabernet Sauvignon | Dalton | 2012 | bouquet black cherry tart cranberry clove open flavor cherry anisette espresso bean mint hint tart cranberry minty note almost seem overly strong moment tart tone bring fruit flavor back foreground pleasantly gripping tannin mellow year aging | 0 | 0.243631 | 0.120869 | ['cherry', 'flavor', 'tone', 'tannin', 'cranberry'] | 0.7326 | 0.897 | 0.000 | 0.103 | POSITIVE | 3 |
| 117343 | France | Initially quite muted, this wine slowly develops impressive richness and spice. It's not sweet, more medium dry, with the spice forming a core of dryness that contrasts with the honeyed texture. It can develop more, so wait to drink until 2016. | Domaine Saint-Rémy Herrenweg | 90 | NaN | Alsace | Alsace | NaN | Roger Voss | @vossroger | Domaine Ehrhart Domaine Saint-Rémy Herrenweg Gewurztraminer (Alsace) | Gewürztraminer | Domaine Ehrhart | 2013 | initially quite muted wine slowly develops impressive richness spice sweet medium dry spice forming core dryness contrast honeyed texture develop wait drink | 3 | 0.205465 | -0.033702 | ['spice', 'texture', 'wine', 'wait', 'contrast'] | 0.6149 | 0.805 | 0.054 | 0.141 | POSITIVE | 3 |
| 117344 | France | While it's rich, this beautiful dry wine also offers considerable freshness. Acidity cuts easily through the ripe white fruit, pear and red apples, allowing room for spice that provides a contrasting aftertaste. | Seppi Landmann Vallée Noble | 90 | 28.0 | Alsace | Alsace | NaN | Roger Voss | @vossroger | Domaine Rieflé-Landmann Seppi Landmann Vallée Noble Pinot Gris (Alsace) | Pinot Gris | Domaine Rieflé-Landmann | 2013 | rich beautiful dry wine also offer considerable freshness acidity cut easily ripe white fruit pear red apple allowing room spice provides contrasting aftertaste | 2 | 0.307170 | -0.203583 | ['acidity', 'wine', 'freshness', 'fruit', 'spice'] | 0.8564 | 0.678 | 0.055 | 0.267 | POSITIVE | 3 |
| 117345 | Germany | Notes of honeysuckle and cantaloupe sweeten this deliciously feather-light spätlese. It's intensely juicy, quenching the palate with streams of tart tangerine and grapefruit acidity, yet wraps up with a kiss of honey and peach. | Brauneberger Juffer-Sonnenuhr Spätlese | 90 | 28.0 | Mosel | NaN | NaN | Anna Lee C. Iijima | NaN | Dr. H. Thanisch (Erben Müller-Burggraef) Brauneberger Juffer-Sonnenuhr Spätlese Riesling (Mosel) | Riesling | Dr. H. Thanisch (Erben Müller-Burggraef) | 2013 | note honeysuckle cantaloupe sweeten deliciously feather light tlese intensely juicy quenching palate stream tart tangerine grapefruit acidity yet wrap kiss honey peach | 2 | 0.172763 | -0.179155 | ['stream', 'palate', 'acidity', 'wrap', 'peach'] | 0.7331 | 0.834 | 0.000 | 0.166 | POSITIVE | 3 |
| 117346 | US | Citation is given as much as a decade of bottle age prior to release, which means it is pre-cellared and drinking at its peak. Baked cherry, cocoa and coconut flavors combine gracefully, with soft, secondary fruit compote highlights. | NaN | 90 | 75.0 | Oregon | Oregon | Oregon Other | Paul Gregutt | @paulgwine | Citation Pinot Noir (Oregon) | Pinot Noir | Citation | 2004 | citation given much decade bottle age prior release mean pre cellared drinking peak baked cherry cocoa coconut flavor combine gracefully soft secondary fruit compote highlight | 1 | 0.147380 | 0.033800 | ['drinking', 'release', 'pre', 'decade', 'mean'] | 0.5267 | 0.914 | 0.000 | 0.086 | POSITIVE | 3 |
| 117347 | France | Well-drained gravel soil gives this wine its crisp and dry character. It is ripe and fruity, although the spice is subdued in favor of a more serious structure. This is a wine to age for a couple of years, so drink from 2017. | Kritt | 90 | 30.0 | Alsace | Alsace | NaN | Roger Voss | @vossroger | Domaine Gresser Kritt Gewurztraminer (Alsace) | Gewürztraminer | Domaine Gresser | 2013 | well drained gravel soil give wine crisp dry character ripe fruity although spice subdued favor serious structure wine age couple year drink | 3 | 0.287841 | -0.102176 | ['wine', 'year', 'soil', 'couple', 'structure'] | 0.1548 | 0.865 | 0.072 | 0.063 | POSITIVE | 3 |
| 117348 | France | A dry style of Pinot Gris, this is crisp with some acidity. It also has weight and a solid, powerful core of spice and baked apple flavors. With its structure still developing, the wine needs to age. Drink from 2015. | NaN | 90 | 32.0 | Alsace | Alsace | NaN | Roger Voss | @vossroger | Domaine Marcel Deiss Pinot Gris (Alsace) | Pinot Gris | Domaine Marcel Deiss | 2012 | dry style pinot gris crisp acidity also weight solid powerful core spice baked apple flavor structure still developing wine need age drink | 3 | 0.271630 | -0.130343 | ['acidity', 'drink', 'spice', 'apple', 'wine'] | 0.5267 | 0.891 | 0.000 | 0.109 | POSITIVE | 3 |
| 117349 | France | Big, rich and off-dry, this is powered by intense spiciness and rounded texture. Lychees dominate the fruit profile, giving an opulent feel to the aftertaste. Drink now. | Lieu-dit Harth Cuvée Caroline | 90 | 21.0 | Alsace | Alsace | NaN | Roger Voss | @vossroger | Domaine Schoffit Lieu-dit Harth Cuvée Caroline Gewurztraminer (Alsace) | Gewürztraminer | Domaine Schoffit | 2012 | big rich dry powered intense spiciness rounded texture lychee dominate fruit profile giving opulent feel aftertaste drink | 3 | 0.172617 | -0.058754 | ['texture', 'drink', 'aftertaste', 'spiciness', 'profile'] | 0.7003 | 0.723 | 0.047 | 0.230 | POSITIVE | 3 |